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(54) Pitch marks management for speech synthesis 

(57) The distance between the first two pitch marks 
of a voiced portion of speech data to be processed is 
calculated. The difference between the adjacent inter- 
pitch-mark distances is calculated. The respective cal- 
culation results are stored and managed in a ffle. 
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Description 

BACKGROUND OF THE INVENTION 

[0001] The present invention relates to a speech syn- s 
thesis apparatus for performing speech synthesis by 
using pitch marks, a control method for the apparatus, 
and a computer-readable memory. 
[0002] Conventionally, processing that synchronizes 
with pitches has been performed as speech anafy- 10 
sis/synthesis processing and the like. For example, in a 
PSOLA (Pitch Synchronous OverLap Adding) speech 
synthesis method, synthetic speech is obtained by add- 
ing one-pitch speech waveform element pieces in syn- 
chronism with pitches. 15 
[0003] In this scheme, information (pitch mark) about 
the position of each pitch must be recorded concurrently 
with storage of speech waveform data. 
[0004] In the prior art described above, however, the 
size of a file on which pitch marks are recorded 20 
becomes undesirably large. 



dmax; 

second comparison means for comparing the 
length d with the minimum value dmin on the basis 
of the comparison result obtained by the first com- 
paring means; 

subtraction means for subtracting the maximum 
value dmax or minimum value dmin from the length 
d on the basis of the comparison results obtained 
by the first and second comparison means; and 
management means for storing the difference 
obtained by the subtraction means or the length d in 
the file and managing the difference or the length 
on the basis of the comparison results obtained by 
the first and second comparison means. 

[001 0] In order to achieve the above object, a speech 
synthesis apparatus according to the present invention 
has the following arrangement 
[0011] There is provided a speech synthesis appara- 
tus for performing speech synthesis by using pitch 
marks, comprising: 



SUMMARY OF THE INVENTION 

[0005] The present invention has been made in con- 2s 
sideration of the above problem, and has as its object to 
provide a speech synthesis apparatus capable of reduc- 
ing the size of a file used to manage pitch marks, a con- 
trol method therefor, and a computer-readable memory. 
[0006] In order to achieve the above object, a speech 30 
synthesis apparatus according to the present invention 
has the following arrangement 
[0007] There is provided a speech synthesis appara- 
tus for performing speech synthesis by using pitch 
marks, comprising: 35 

first calculation means for calculating a distance 

between first two pitch marks 

speech data to be processed; 

second calculation means for calculating a differ- 40 

ence between adjacent irrter-pitch-mark distances; 

and 

management means for storing the calculation 
results obtained by the first and second calculation 
means in a file and managing the results. 4s 

[0008] In order to achieve the above object, a speech 
synthesis apparatus according to the present invention 
has the following arrangement 

[0009] There is provided a speech synthesis appara- so 
tus for performing speech synthesis by using pitch 
marks, comprising: 

first comparison means for, when a length of 
speech data to be processed is represented by d, ss 
and a maximum value dmax and a minimum value 
dmin are defined for a predetermined word length, 
comparing the length d with the maximum value 



storage means for storing a file for managing a dis- 
tance between first two pitch marks of a voiced por- 
tion of speech data to be processed and a 
difference between adjacent inter-pitch-mark dis- 
tances; 

first loadng means for loading the distance 
between the first two pitch marks of the voiced por- 
tion; 

second loading means for loacfing the Difference 
between the adjacent inter-pitch-mark distances; 
and 

calculation means for calculating a next pitch mark 
position from a pitch mark position calculated 
immediately before the calculation, a pitch mark 
distance to an adjacent pitch mark, and the dis- 
tance and difference loaded by the first and second 
loacfing means. 

[0012] In order to achieve the above object a control 
method for a speech synthesis apparatus according to 
the present invention has the following steps. 
[0013] There is provided a control method for a 
speech synthesis apparatus for performing speech syn- 
thesis by using pitch marks, comprising : 

the first calculation step of calculating a distance 
between first two pitch marks of a voiced portion of 
speech data to be processed; 
the second calculation step of calculating a differ- 
ence between adjacent inter-pitch-mark distances; 
and 

the management step of storing the calculation 
results obtained in the first and second calculation 
steps in a file and managing the results. 

[0014] In order to achieve the above object a control 
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method for a speech synthesis apparatus according to 
the present invention has the following steps. 
[00151 There is provided a control method for a 
speech synthesis apparatus for performing speech syn- 
thesis by 5 

using pitch marks, comprising: the first comparison 
step of, when a length of speech data to be proc- 
essed is represented by d, and a maximum value 
dmax and a minimum value dmin are defined for a w 
predetermined word length, comparing the length d 
with the maximum value dmax; the second compar- 
ison step of comparing the length d with the mini- 
mum value dmin on the basis of the comparison 
result obtained in the first comparing step; is 
the subtraction step of subtracting the maximum 
value dmax or minimum value dmin from the length 
d on the basis of the comparison results obtained in 
the first and second comparison steps; and 
the management step of storing the difference 20 
obtained in the subtraction step or the length d in 
the ffle and managing the difference or the length 
on the basis of the comparison results obtained in 
the first and second comparison steps. 

25 

[0016] In order to achieve the above object, a control 
method for a speech synthesis apparatus according to 
the present invention has the following steps. 
[0017] There is provided a control method for a 
speech synthesis apparatus for performing speech syrv 30 
thesis by using pitch marks, comprising: 

the storage step of storing a file for managing a dis- 
tance between first two pitch marks of a voiced por- 
tion of speech data to be processed and a 35 
difference between adjacent inter-pitch-mark dis- 
tances; 

the first loading step of loading the distance 
between the first two pitch marks of the voiced por- 
tion; 40 
the second loading step of loading the difference 
between the adjacent inter-pitch-mark distances; 
and 

the calculation step of calculating a next pitch mark 
position from a pitch mark position calculated 45 
immediately before the calculation, a pitch mark 
distance to an adjacent pitch mark, and the dis- 
tance and difference loaded in the first and second 
loading steps. 

50 

[0018] In order to achieve the above object a compu- 
ter-readable memory according to the present invention 
has the following program codes. 
[001 9] There is provided a computer-readable mem- 
ory storing prog^m codes for controlling a speech syn- ss 
thesis apparatus for performing speech synthesis by 
using pitch marks, comprising: 



a program code for the first calculation step of cal- 
culating a distance between first two pitch marks of 
a voiced portion of speech data to be processed; 
a program code for the second calculation step of 
calculating a difference between adjacent inter- 
pitch-mark distances; and 
a program code for the management step of storing 
the calculation results obtained in the first and sec- 
ond calculation steps in a file and managing the 
results. 

[0020] In order to achieve the above object, a compu- 
ter-readable memory according to the present invention 
has the following program codes. 
[0021 ] There is provided a computer-reliable mem- 
ory storing program codes for controlling a speech syn- 
thesis apparatus for performing speech synthesis by 
using pitch marks, comprising: 

a program code for the first comparison step of, 
when a length of speech data to be processed is 
represented by d, and a maximum value dmax and 
a minimum value dmin are defined for a predeter- 
mined word length, comparing the length d with the 
maximum value dmax; 

a program code for the second comparison step of 
comparing the length d with the minimum value 
dmin on the basis of the comparison result obtained 
in the first comparing step; 
a program code for the subtraction step of subtract- 
ing the maximum value dmax or minimum value 
dmin from the length d on the basis of the compari- 
son results obtained in the first and second compar- 
ison steps; and 

a program code for the management step of storing 
the difference obtained in the subtraction step or 
the length din the file and managing the difference 
or the length on the basis of the comparison results 
obtained in the first and second comparison steps. 

[Q022] In order to achieve the above object, a compu- 
ter-readable memory according to the present invention 
has the following program codes. 
[0023] There is provided a computer-readable mem- 
ory storing program codes for controlling a speech syn- 
thesis apparatus for performing speech synthesis by 
using pitch marks, comprising: 

a program code for the storage step of storing a file 
for managing a distance between first two pitch 
marks of a voiced portion of speech data to be 
processed and a difference between adjacent inter- 
pitch-mark distances; 

a program code for the first loading step of loading 
the distance between the first two pitch marks of the 
voiced portion; 

a program code for the second loading step of load- 
ing the difference between the adjacent irrter-pitch- 
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mark distances; and 

a program code for the calculation step of calculat- 
ing a next pitch mark position from a pitch mark 
position calculated immediately before the calcula- 
tion, a pitch mark distance to an adjacent pitch s 
mark, and the distance and difference loaded in the 
first and second loading steps. 

[0024] Other features and advantages of the present 
invention wfll be apparent from the following description io 
taken in conjunction with the accompanying drawings, 
in which like reference characters designate the same 
or similar parts throughout the figures thereof. 

BRIEF DESCRIPTION OF THE DRAWINGS 15 

[0025] 

Fig. 1 is a block diagram showing the arrangement 
of a speech synthesis apparatus according to the 20 
first embodiment of the present invention; 
Fig. 2 is a flow chart showing pitch mark data file 
generation processing executed in the first embodi- 
ment of the present invention; 
Fig. 3 is a view for explaining pitch marks in the first 25 
embodiment of the present invention; 
Fig. 4 is a flow chart showing another example of 
the pitch mark data file generation processing exe- 
cuted in the first embodiment of the present inven- 
tion; 30 
Fig. 5 is a flew chart showing another example of 
the processing of recording the pitch marks of a 
voiced portion in the first embodiment of the 
present invention; 

Fig- 6 is a flow chart showing pitch mark data ffle 35 
loading processing executed in the second embod- 
iment of the present invention; and 
Fig. 7 is a flow chart shewing another example of 
the processing of loading the pitch marks of a 
voiced portion in the second embodiment of the 40 
present invention. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

45 

[First Embodiment] 

[0026] Fig. 1 is a block diagram showing the arrange- 
ment of a speech synthesis apparatus according to the 
first embodiment of the present invention. 50 
[0027] Reference numeral 103 denotes a CPU for per- 
forming numerical operation/control, control on the 
respective components of the apparatus, and the like, 
which are executed in the present invention; 102, a 
RAM serving as a work area for processing executed in ss 
the present invention, a temporary saving area for vari- 
ous data and having an area for storing a pitch mark 
data file 101a; 101 , a ROM storing various control pro- 



grams such as programs executed in the present inven- 
tion, for managing pitch mark data used for speech 
synthesis; 109, an external storage unit serving as an 
area for storing processed data; and 105, a D/A con- 
verter for converting the digital speech data synthesized 
by the speech synthesis apparatus into analog speech 
data and outputb'ng it from a loudspeaker 1 1 0. 
[0028] Reference numeral 1 06 denotes a display con- 
trol unit for controlling a display 1 1 1 when the process- 
ing state and processing results of the speech synthesis 
apparatus, and a user interface are to be displayed; 
107, an input control unit for recognizing key information 
input from a keyboard 112 and executing the desig- 
nated processing; 108, a communication control unit for 
controlling transmission/reception of data through a 
communication network 113; and 104, a bus for con- 
necting the respective components of the speech syn- 
thesis apparatus to each other. 
[0029] Pitch mark data file generation processing exe- 
cuted in the first embodiment will be described next with 
reference to Fig. 2. 

[0030] Fig. 2 is a flow chart showing pitch mark data 
fOe generation processing executed in the first embodi- 
ment of the present invention. 

[0031] As shewn in Fig. 3, pitch marks Pi, P 2 Pj. 

Ph.1 are arranged in each voiced portion at certain inter- 
vals, but no pitch mark is present in any unvoiced por- 
tion. 

[0032] First of all, it is checked in step S1 whether the 
first segment of speech data to be processed is a voiced 
or unvoiced portion. If it is determined that the first seg- 
ment is a voiced portion (YES in step S1), the flow 
advances to step S2. If it is determined that the first seg- 
ment is an unvoiced portion (NO in step S1), the flow 
advances to step S3. 

[0033] In step S2, voiced portion start information indi- 
cating that The first segment is a voiced portion" is 
recorded. In step S4, a first inter-ptch-mark distance 
(distance between the first pitch mark p 1 and the sec- 
ond pitch mark pg of the voiced portion) d 1 is recorded 
in the pitch mark data file 101a. In step S5, the value of 
a loop counter i is initialized to 2. 
[0034] It is then checked in step S6 whether the voiced 
portion ends with the ith pitch mark p; indcated by the 
value of the loop counter i. If it is determined that the 
voiced portion does not end with the pitch mark pj (NO 
in step S6), the flow advances to step S7 to obtain the 
difference (dj - d^) between an inter-pitch-mark rJs- 
tance dj and an inter-pitch-mark distance d M . In step 
S8, the obtained difference (dj - d M ) is recorded in the 
pitch mark data file 101a. In step S9, the loop counter i 
is incremented by 1 , and the flow returns to step S6. 
[0035] If it is determined that the voiced portion ends 
(YES in step S6). the flow advances to step S10 to 
record a voiced portion end signal indicating the end of 
the voiced portion in the pitch mark data file 101a. Note 
that any signal can be used as the voiced portion end 
signal as long as it can be discriminated from an inter- 
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pitch-mark cfistance. In step S11, it is checked whether 
the speech data has ended. If it is determined that the 
speech data has not ended (NO in step S11), the flow 
advances to step S1 2. If it is determined that the speech 
data has ended (YES in step S1 1 ), the processing is ter- 5 
minated. 

[0036] It is determined in step S1 that the first seg- 
ment of the speech data is an unvoiced portion (NO in 
step S1), the flow advances to step S3 to record 
unvoiced portion start information indicating that "the 
first segment is an unvoiced portion" in the pitch mark 
data file 101a. In step S12, a distance ds between the 
voiced portion and the next voiced portion (i.e., the 
length of the unvoiced portion) is recorded in the pitch 
mark data file 101a In step S13, it is checked whether 
the speech data has ended, If it is determined that the 
speech data has not ended (NO in step S13). the flow 
advances to step S4. If rt is determined that the speech 
data has ended (YES in step S 1 3), the processing is ter- 
minated. 

[0037] As described above, according to the first 
embodiment, since the respective pitch marks in each 
voiced portion are managed by using the distances 
between the adjacent pitch marks, all the pitch marks in 
each voiced portion need not be managed. This can 
recfejce the size of the pitch mark data file 101a. 
[0038] In the first embodiment step S10 may be 
replaced wfth step S14 of counting the number (n) of 
pitch marks in each voiced portion and step S15 of 
recording the counted number n of pitch marks in the 
pitch mark data fie 101a, as shown in Fig. 4. In this 
case, the processing in step S6 amounts to checking 
whether the value of the loop counter i is equal to the 
number n of pitch marks. 

[0039] Another example of the processing of record- 
ing pitch marks of each voiced portion in the first 
embodiment wfll be described with reference to Rg. 5. 
[0040] Fig. 5 is a flow chart showing another example 
of the processing of recording pitch marks of each 
voiced portion in the first embodiment of the present 
invention. 

[0041] For example, the data length of speech data to 
be processed is represented by d, and a maximum 
value dmax (ag., 127) and a minimum value dmin (ag.. 
-127) are defined for a gpven word length (ag., 8 bits). 4s 
[0042] First of all. in step S16, d is compared with 
dmax. tf d is equal to or larger than dmax (YES in step 
S16), the flow advances to step S1 7 to record the max- 
imum value dmax in the pitch mark data file 101a. In 
step S18, dmax is subtracted from d, and the flow so 
returns to step S16. If it is determined that d is smaller 
than dmax (NO in step S16), the flow advances to step 
S19. 

[0043] In step S19, d is compared with dmin. tf d is 
equal to or smaller than dmin (YES in step S19), the ss 
flow advances to step S20 to record the minimum value 
dmin in the pitch mark data file 101a. In step S21 , dmin 
is subtracted from d. and the flow returns to stepS19. If 



it is determined that d is larger than dmin (NO in step 

S19), the flow advances to step S22 to record d. The 

processing is then terminated. 

[0044] With this recording, for example, dmin-1 (-128 

in the above case) can be used as a voiced portion end 

signal. 

[Second Embodiment] 

[0045] In the second embodiment, pitch mark data file 
loading processing of loading data from the pitch mark 
data file 101a recorded in the first embodiment will be 
descrbed with reference to Fig. 6. 
[0046] Rg. 6 is a flow chart showing pitch mark data 
ffle loading processing executed in the second embodi- 
ment of the present invention. 
[0047] Rrst of all. in step S23, start information indi- 
cating whether the start of speech data to be processed 
is a voice or unvoiced portion, is loaded from a pitch 
mark data file 101a. It is then checked in step S24 
whether the loaded start information is voiced portion 
start information. H voiced portion start information is 
determined (YES in step $24), the flow advances to 
step S25 to load a first inter-pitch-mark distance (dis- 
tance between a first pitch mark p 1 and a second pitch 
mark p^ of the voiced portion) d 1 from the pitch mark 
data fie 101a. Note that the second pitch mark P2 is 
located at p^^ 

[0048] tn step S26. the value of a loop counter i is ini- 
tialized to 2. In step S27. a difference d r (data corre- 
sponding the length of one word) from the pitch mark 
data file 101a In step S28, it is checked whether the 
loaded differenced,, is a voiced portion end signal, ff it is 
determined that the difference is not a voiced portion 
end signal (NO in step S28), the flow advances to step 
S29 to calculate a next inter-pitch-mark distance dj and 
pitch mark position p^ from a pitch mark position pi, 
inter-pitch-mark distance d^.and d r obtained in the 
past 

[0049] The following equations can be formulated 
from pi, d^, d„ dj, and p^i- The next inter-pitch-mark 
cfistance dj and pitch mark position p k1 can be calcu- 
lated by using these equations. 

d,-d w +d r (1) 

PM.i = Pi + d| (2) 

[0050] In step S30, the loop counter i is incremented 
by 1 . The flow then returns to step S27. 
[0051] If it is determined that d r is a voiced portion end 
signal (YES in step S28). the flow advances to step S31 
to check whether the speech data has ended, ff it is 
determined that the speech data has not ended (NO in 
step S31), the flow advances to step S32. ff it is deter- 
mined that the speech data has ended (YES in step 
S31), the processing is terminated. 
[0052] ff it is determined in step S24 that the loaded 
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information is not voiced portion start information (NO in 
step S24), the flow advances to step S32 to load a dis- 
tance ds to the next voiced portion from the pitch mark 
data file 101a. It is then checked in step S33 whether 
the speech data has ended. If it is determined that the 5 
speech data has not ended (NO in step S33), the flow 
advances to step S25. If it is determined that the speech 
data has ended (YES in step S33), the processing is ter- 
minated. 

[0053] As descrtoed above, according to the second 10 
embodiment, since pitch marks can be loaded by using 
the pitch mark data file 101a managed by the process- 
ing described in the first embodiment, the size of data to 
be processed decreases to improve the processing effi- 
ciency, is 
[0054] Another example of the processing of loacfing 
pitch marks of each voiced portion in the second 
embodiment will be descrbed with reference to Fig. 7. 
[0055] Fig. 7 is a flow chart showing another example 
of the processing of loading pitch marks of each voiced 20 
portion in the second embodiment of the present inven- 
tion. 

[0056] Assume that the data length information of 
loaded speech data is stored in a register d, and a max- 
imum value dmax (e.g., 127), a minimum value dmin ss 
(e.g, -127), and a voiced portion end signal are defined 
for a given word length (ag., 8 bits) in Fig. 5. 
[0057] First of all, in step S34, the register d is initial- 
ized to 0. In step S35, the data d r corresponding the 
length of one word is loaded from the pitch mark data 30 
file 101a. It is then checked h step S36 whether dr is a 
voiced portion end signal. If it is determined that the d, 
is a voiced portion end signal (YES in step S36). the 
processing is terminated. If it is determined that d r is not 
a voiced portion end signal (NO in step S36), the flow 35 
advances to step S37 to add d r to the contents of the 
register d. 

[0058] In step S38, it is checked whether oV is equal to 
dmax or dmin. If it is determined that they are equal 
(YES in step S38), the flow returns to step S35. If it is 40 
determined that they are not equal (NO in step S38), the 
processing is terminated. 

[0059] Note that the present invention may be applied 
to either a system constituted by a pturafity of equip- 
ments (e.g., a host computer, an interface device, a 45 
reader, a printer, and the tike), or an apparatus consist- 
ing of a single equipment (e.g., a copying machine, a 
facsimile apparatus, or the like). 
[0060] The objects of the present invention are also 
achieved by supplying a storage medium, which records so 
a program code of a software program that can realize 
the functions of the above-mentioned embodiments to 
the system or apparatus, and reading out and executing 
the program code stored in the storage medium by a 
computer (or a CPU or MPU) of the system or appara- ss 
tus. 

[0061] In this case, the program code itsetf read out 
from the storage medium realizes the functions of the 



above-mentioned embodiments, and the storage 
medium which stores the program code constitutes the 
present invention. 

[0062] As the storage medium for supplying the pro- 
gram code, for example, a floppy disk, hard disk, optical 
disk, magneto-optical disk, CD-ROM, CD-R, magnetic 
tape, nonvolatile memory card, ROM, and the like may 
be used. 

[0063] The functions of the above-mentioned embod- 
iments may be realized not only by executing the read- 
out program code by the computer but also by some or 
all of actual processing operations executed by an OS 
(operating system) running on the computer on the 
basis of an instruction of the program code. 
(0064] Furthermore, the functions of the above-men- 
tioned embodiments may be realized by some or all of 
actual processing operations executed by a CPU or the 
like arranged in a function extension board or a function 
extension unit, which is inserted in or connected to the 
computer, after the program code read out from the 
storage medium is written in a memory of the extension 
board or unit 

[0065] Further, the program code can be obtained in 
electronic form for example by downloading the code 
over a network such as the internet Thus in accordance 
with another aspect of the present invention there is 
provided an electrical signal carrying processor imple- 
mentable instructions for controlling a processor to 
carry out the method as hereinbefore described. 
[0066] As many apparently widely drfferent embodi- 
ments of the present invention can be made without 
departing from the spirit and scope thereof, it is to be 
understood that the invention is not Dmited to the spe- 
cific embodiments thereof except as defined in the 
appended claims. 

Claims 

1. A speech synthesis apparatus for performing 
speech synthesis by using pitch marks, character- 
ized by comprising: 

first calculation means (103) for calculating a 
distance between first two pitch marks of a 
voiced portion of speech data to be processed; 
second calculation means (103) for calculating 
a difference between adjacent inter-prtch-mark 
distances; and 

management means (102) for storing the cal- 
culation results obtained by said first and sec- 
ond calculation means in a file and managing 
the results. 

2. The apparatus according to claim 1 , characterized 
in that said management means further calculates 
an inter-voiced-portion distance as a distance 
between voiced portions on both sides of an 
unvoiced portion, stores the distance in the file, and 
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manages the distance. 

3. The apparatus according to claim 1 , characterized 
by further comprising counting means for counting 
the number of pitch marks of the voiced portion, 
and 

when the number of pitch marks is counted 
by said counting means, said management means 
stores the number of pitch marks in the fie and 
manages the number of pitch marks. 

4. A speech synthesis apparatus for performing 
speech synthesis by using pitch marks, character- 
ized by comprising: 

first comparison means (103) for, when a 
length of speech data to be processed is repre- 
sented by d, and a maximum valuedmaxanda 
minimum value dmin are defined for a predeter- 
mined word length, comparing the length d with 
the maximum value dmax; 
second comparison means (103) for compar- 
ing the length d with the minimum value dmin 
on the basis of the comparison result obtained 
by said first comparing means; 
subtraction means (103) for subtracting the 
maximum value dmax or minimum value dmin 
from the length d on the basis of the compari- 
son results obtained by said first and second 
comparison means; and 
management means (1 02) for storing the differ- 
ence obtained by said subtraction means or the 
length d in the file and managing the difference 
or the length on the basis of the comparison 
results obtained by said first and second com- 
parison means. 

5. The apparatus according to claim 4, characterized 
in that said subtraction means subtracts the maxi- 
mum value dmax from the length d when the com- 
parison result obtained by said first comparison 
means indicates that the length d is not less than 
the maximum value dmax, and subtracts the mini- 
mum value dmin from the length d when the com- 
parison resutt obtained by satf second 

means indicates that the length d is not more than 
the minimum value dmin. 

6. A speech synthesis apparatus for performing 
speech synthesis by using pitch marks, character- 
ized by comprising: 

storage means (102) tor storing a He for man- 
aging a distance between first two pitch marks 
of a voiced portion of speech data to be proc- 
essed and a difference between adjacent inter- 
pitch-mark distances; 

first loading means (103) for loading the dis- 



tance between the first two pitch marks of the 
voiced portion; 

second loading means (103) for loading the dif- 
ference between the adjacent imer-prtch-mark 

5 distances; and 

calculation means (103) for calculating a next 
pitch mark position from a pitch mark position 
calculated immediately before the calculation, 
a pitch mark distance to an adjacent pitch 

ro mark, and the cfistance and cfifference loaded 

by said first and second loading means. 

7. The apparatus according to claim 6, characterized 
in that in the file stored in said storage means, a dis- 

is tance between voiced portions on both sides of an 
unvoiced portion is managed, and 

said calculation means loads the distance 
between the voiced portions on both sides of the 
unvoiced portion when processing is to be per- 

20 formed for the next voiced portion. 

8. The apparatus according to claim 6, characterized 
in that when a data length of data to be processed 
is held, and a maximum value dmax and a minimum 

25 value dmin are defined for a predetermined word 
length, fixed-length data d r is also managed in the 
fie stored in said storage means, and 

it is checked whether a value obtained by 
load ng the fixed-l ength data d r and adding d to the 

30 data oV is equal to the maximum value dmax or the 
minimum value dmin, and the fixed-length data d r is 
loaded when the value is equal to the maximum 
value dmax or the minimum value dmin. 

35 9. A control method for a speech synthesis apparatus 
for performing speech synthesis by using pitch 
marks, characterized by comprising: 

a first calculation step (S4) of calculating a dis- 
40 tance between first two pitch marks of a voiced 

portion of speech data to be processed; 
a second calculation step (S7) of calculating a 
difference between adjacent irner-prtch-mark 
distances; and 

45 a management step (S8) of storing the calcula- 

tion results obtained in said first and second 
calculation steps in a file and managing the 
results. 

so 10. The method according to claim 9, characterized in 
that said management step further comprises cal- 
culating an inter-voiced-portion distance as a dis- 
tance between voiced portions on both sides of an 
unvoiced portion, storing the distance in the file, 

55 and managing the distance. 

11. The method according to claim 9, further compris- 
ing a counting step of counting the number of pitch 
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marks of the voiced portion, and 

when the number of pitch marks is counted 
in said counting step, said management step com- 
prises storing the number of pitch marks in the ffle 
and manages the number of pitch marks. 5 

1 2. A control method for a speech synthesis apparatus 
for performing speech synthesis by using pitch 
marks, characterized by comprising: 

10 

a first comparison step (S16) of, when a length 
of speech data to be processed is represented 
by d, and a maximum value dmax and a mini- 
mum value dmin are defined for a predeter- 
mined word length, comparing the length d with is 
the maximum value dmax; 
a second comparison step (S19) of comparing 
the length d with the minimum value dmin on 
the basts of the comparison result obtained in 
said first comparing step; 20 
a subtraction step (S18, S21) of subtracting the 
maximum value dmax or minimum value dmin 
from the length d on the basis of the compari- 
son results obtained in said first and second 
comparison steps; and 2$ 
a management step (S22) of storing the differ- 
ence obtained in the subtraction step or the 
length d in the file and managing the difference 
or the length on the basis of the comparison 
results obtained in said first and second com- 30 
parison steps. 

13. The method according to claim 12, characterized in 
that said subtraction step comprises subtracting the 
maximum value dmax from the length d when the 35 
comparison result obtained in said first comparison 
step indicates that the length d is not less than the 
maximum value dmax, and subtracting the mini- 
mum value dmin from the length d when the com- 
parison result obtained in said second comparison 40 
step indicates that the length d is not more than the 
minimum value dmin. 

14. A control method for a speech synthesis apparatus 

for performing speech synthesis by using pitch 45 
marks, characterized by comprising: 

a storage step of storing (S23) a He for manag- 
ing a distance between first two pitch marks of 
a voiced portion of speech data to be proc- so 
essed and a difference between adjacent irrter- 
prtch-mark distances; 

a first loading step (S25) of loading the dis- 
tance between the first two pitch marks of the 
voiced portion; 55 
a second loading step (S27) of loacfing the dif- 
ference between the adjacent inter-pitch-mark 
distances; and 



a calculation step (S29) of calculating a next 
pitch mark position from a pitch mark position 
calculated immediately before the calculation, 
a pitch mark distance to an adjacent pitch 
mark, and the tistance and cfifference loaded 
in said first and second loading steps. 

15. The method according to claim 14, characterized in 
that in the file stored in said storage step, a distance 
between voiced portions on both sides of an 
unvoiced portion is managed, and 

a calculation step comprises loading the ds- 
tance between the voiced portions on both sides of 
the unvoiced portion when processing is to be per- 
formed for the next voiced portion. 

16. The method according to claim 14, characterized 
by fixed-length data d r in the file stored in said stor- 
age step when a data length of data to be proc- 
essed is held, and a maximum value dmax and a 
minimum value dmin are defined for a predeter- 
mined word length, and 

a step of checking whether a value obtained 
by loading the fixed-length data oV and adding d to 
the data d r is ecpja! to the maximum value dmax or 
the minimum value dmin, and loading the fixed- 
length data oV when the value is equal to the maxi- 
mum value dmax or the minimum value dmin. 

17. A computer-readable memory storing program 
codes for controlling a speech synthesis apparatus 
for performing speech synthesis by using pitch 
marks, characterized by comprising: 

a program code for the first calculation step of 
calculating a distance between first two pitch 
marks of a voiced portion of speech data to be 
processed; 

a program code for the second calculation step 
of calculating a difference between adjacent 
inter-prtch-mark distances; and 
a program code for the management step of 
storing the calculation results obtained in the 
first and second calculation steps in a ffle and 
managing the results. 

1& A computer-readable memory storing program 
codes for controlling a speech synthesis apparatus 
for performing speech synthesis by using pitch 
marks, characterized by comprising: 

a program code for the first comparison step of. 
when a length of speech data to be processed 
is represented by d, and a maximum value 
dmax and a minimum value dmin are defined 
for a predetermined word length, comparing 
the length d with the maximum value dmax; 
a program code for the second comparison 
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step of comparing the length d with the mini- 
mum value dmin on the basis of the compari- 
son result obtained in said first comparing step; 
a pro&am code for the subtraction step of sub- 
tracting the maximum value dmax or minimum s 
value dmin from the length d on the basis of the 
comparison results obtained in said first and 
second comparison steps; and 
a program code for the management step of 
storing the difference obtained in the subtrac- 10 
tion step or the length d in the file and manag- 
ing the difference or the length on the basts of 
the comparison results obtained in said first 
and second comparison steps. 

15 

19. A computer-readable memory storing program 
codes for controlling a speech synthesis apparatus 
for performing speech synthesis by using pitch 
marks, characterized by comprising: 

20 

a program code for the storage step of storing 
a f Oe for managing a distance between first two 
pitch marks of a voiced portion of speech data 
to be processed and a difference between 
adjacent inter-pitch-mark cfistances; zs 
a program code for the first loading step of 
loading the distance between the first two pitch 
marks of the voiced portion; 
a program code for the second loading step of 
loading the difference between the adjacent 30 
inter-pitch-mark distances; and 
a program code for the calculation step of cal- 
culating a next pitch mark position from a pitch 
mark position calculated immediately before 
the calculation, a pitch mark distance to an 35 
adjacent pitch mark, and the distance and dif- 
ference loaded in said first and second loading 
steps. 

20. A method of compressing data representative of 40 
pitch mark information for use in determining pitch 
information to be combined with speech waveform 
elements in a method of speech synthesis, the pitch 
mark information being in the form of a series of 
position data values representing the timing of pitch 45 
information relative to the speech waveforms, the 
method comprising; 

(a) converting the series of position data values 

to distance data comprising a series of inter- so 
pitch mark distances each representing a dis- 
tance between adjacent positions; 

(b) calculating a series of cffference values 
between the magnitudes of adjacent inter-pitch 
mark distances in the series of inter-pitch mark 55 
distances; and 

(c) outputting a set of output data comprising 
the value of a first inter-pitch mark distance in 



the series of inter-pitch mark distances and the 
difference values of the series of difference val- 
ues. 

21. An electrical signal carrying processor implementa- 
ble instructions for controlling a processor to carry 
of the method of any one of claims 9 to 16 and 20. 
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